57 research outputs found
Cross-Domain Labeled LDA for Cross-Domain Text Classification
Cross-domain text classification aims at building a classifier for a target
domain which leverages data from both source and target domain. One promising
idea is to minimize the feature distribution differences of the two domains.
Most existing studies explicitly minimize such differences by an exact
alignment mechanism (aligning features by one-to-one feature alignment,
projection matrix etc.). Such exact alignment, however, will restrict models'
learning ability and will further impair models' performance on classification
tasks when the semantic distributions of different domains are very different.
To address this problem, we propose a novel group alignment which aligns the
semantics at group level. In addition, to help the model learn better semantic
groups and semantics within these groups, we also propose a partial supervision
for model's learning in source domain. To this end, we embed the group
alignment and a partial supervision into a cross-domain topic model, and
propose a Cross-Domain Labeled LDA (CDL-LDA). On the standard 20Newsgroup and
Reuters dataset, extensive quantitative (classification, perplexity etc.) and
qualitative (topic detection) experiments are conducted to show the
effectiveness of the proposed group alignment and partial supervision.Comment: ICDM 201
Co-contributorship Network and Division of Labor in Individual Scientific Collaborations
Collaborations are pervasive in current science. Collaborations have been
studied and encouraged in many disciplines. However, little is known how a team
really functions from the detailed division of labor within. In this research,
we investigate the patterns of scientific collaboration and division of labor
within individual scholarly articles by analyzing their co-contributorship
networks. Co-contributorship networks are constructed by performing the
one-mode projection of the author-task bipartite networks obtained from 138,787
papers published in PLoS journals. Given a paper, we define three types of
contributors: Specialists, Team-players, and Versatiles. Specialists are those
who contribute to all their tasks alone; team-players are those who contribute
to every task with other collaborators; and versatiles are those who do both.
We find that team-players are the majority and they tend to contribute to the
five most common tasks as expected, such as "data analysis" and "performing
experiments". The specialists and versatiles are more prevalent than expected
by a random-graph null model. Versatiles tend to be senior authors associated
with funding and supervisions. Specialists are associated with two contrasting
roles: the supervising role as team leaders or marginal and specialized
contributions.Comment: accepted by JASIS
Unshifted Metastable He I* Mini-Broad Absorption Line System in the Narrow Line Type 1 Quasar SDSS J080248.18551328.9
We report the identification of an unusual absorption line system in the
quasar SDSS J080248.18551328.9 and present a detailed study of the system,
incorporating follow-up optical and NIR spectroscopy. A few tens of absorption
lines are detected, including He I*, Fe II* and Ni II* that arise from
metastable or excited levels, as well as resonant lines in Mg I, Mg II, Fe II,
Mn II, and Ca II. All of the isolated absorption lines show the same profile of
width km s centered at a common redshift as that of
the quasar emission lines, such as [O II], [S II], and hydrogen Paschen and
Balmer series. With narrow Balmer lines, strong optical Fe II multiplets, and
weak [O III] doublets, its emission line spectrum is typical for that of a
narrow-line Seyfert 1 galaxy (NLS1). We have derived reliable measurements of
the gas-phase column densities of the absorbing ions/levels. Photoionization
modeling indicates that the absorber has a density of and a column density of , and is located at
pc from the central super-massive black hole. The location of the absorber, the
symmetric profile of the absorption lines, and the coincidence of the
absorption and emission line centroid jointly suggest that the absorption gas
is originated from the host galaxy and is plausibly accelerated by stellar
processes, such as stellar winds \zhy{and/or} supernova explosions. The
implications for the detection of such a peculiar absorption line system in an
NLS1 are discussed in the context of co-evolution between super-massive black
hole growth and host galaxy build-up.Comment: 28 pages, 16 figures; accepted for publication in Astrophysical
Journa
Conceptual Text Region Network: Cognition-Inspired Accurate Scene Text Detection
Segmentation-based methods are widely used for scene text detection due to their superiority in describing arbitrary-shaped text instances. However, two major problems still exist: (1) current label generation techniques are mostly empirical and lack theoretical support, discouraging elaborate label design; and (2) as a result, most methods rely heavily on text kernel segmentation which is unstable and requires deliberate tuning. To address these challenges, we propose a human cognition-inspired framework, termed, Conceptual Text Region Network (CTRNet). The framework utilizes Conceptual Text Regions (CTRs), which is a class of cognition-based tools inheriting good mathematical properties, allowing for sophisticated label design. Another component of CTRNet is an inference pipeline that, with the help of CTRs, completely omits the need for text kernel segmentation. Compared with previous segmentation-based methods, our approach is not only more interpretable but also more accurate. Experimental results show that CTRNet achieves state-of-the-art performance on benchmark CTW1500, Total-Text, MSRA-TD500, and ICDAR 2015 datasets, yielding performance gains of up to 2.0%. Notably, to the best of our knowledge, CTRNet is among the first detection models to achieve F-measures higher than 85.0% on all four of the benchmarks, demonstrating remarkable consistency and stability
Model-based analysis uncovers mutations altering autophagy selectivity in human cancer
Autophagy can selectively target protein aggregates, pathogens, and dysfunctional organelles for the lysosomal degradation. Aberrant regulation of autophagy promotes tumorigenesis, while it is far less clear whether and how tumor-specific alterations result in autophagic aberrance. To form a link between aberrant autophagy selectivity and human cancer, we establish a computational pipeline and prioritize 222 potential LIR (LC3-interacting region) motif-associated mutations (LAMs) in 148 proteins. We validate LAMs in multiple proteins including ATG4B, STBD1, EHMT2 and BRAF that impair their interactions with LC3 and autophagy activities. Using a combination of transcriptomic, metabolomic and additional experimental assays, we show that STBD1, a poorly-characterized protein, inhibits tumor growth via modulating glycogen autophagy, while a patient-derived W203C mutation on LIR abolishes its cancer inhibitory function. This work suggests that altered autophagy selectivity is a frequently-used mechanism by cancer cells to survive during various stresses, and provides a framework to discover additional autophagy-related pathways that influence carcinogenesis
Extreme Coronal Line Emitters: Tidal Disruption of Stars by Massive Black Holes in Galactic Nuclei?
Tidal disruption of stars by supermassive black holes at the centers of
galaxies is expected to produce unique emission line signatures, which have not
yet been explored adequately. Here we report the discovery of extremely strong
coronal lines from [Fe X] up to [Fe XIV] in a sample of seven galaxies
(including two recently reported cases), that we interpret as such signatures.
This is the first systematic search for objects of this kind, by making use of
the immense database of the Sloan Digital Sky Survey. The galaxies, which are
non-active as evidenced by the narrow-line ratios, show broad emission lines of
complex profiles in more than half of the sample. Both the high ionization
coronal lines and the broad lines turn out to be fading on time scales of years
in objects observed with spectroscopic follow-ups, suggesting their transient
nature. Variations of inferred non-stellar continua, which have absolute
magnitudes of at least -16 to -18 mag in the g band, are also detected in more
than half of the sample. These extreme coronal line emitters reside in sub-L_*
disk galaxies (-21.3 < M_i < -18.5) with small stellar velocity dispersions.
The sample seems to form two distinct types based on the presence or absence of
the [Fe VII] lines, with the latter having relatively low luminosities of [O
III], [Fe XI], and the host galaxies. These characteristics can most naturally
be understood in the context of transient accretion onto intermediate mass
black holes at galactic centers following tidal disruption of stars in a
gas-rich environment. We estimate the incidence of such events to be around
10^-5 per year for a galaxy with -21.5 < M_i < -18.5.Comment: 32 pages, 12 figures, ApJ accepted, typos correcte
Enhancement of Diffusion Assisted Bonding of the Bimetal Composite of Austenitic/Ferric Steels via Intrinsic Interlayers
We investigate the effect of the intrinsic interlayers on the diffusion assisted bonding properties of the austenitic steel (stainless steel 316L) and ferric steels (Low-carbon steel Q345R) in a hot rolling process by molecular dynamics simulations and experiment. The introduction of an intrinsic interlayer (Cr or Ni) widens the diffusion region, leading to enhancement of bonding. The thickness of the diffusion region enlarges with an increase of temperature, with an enhancement factor of 195% and 108%, for Cr and Ni interlayer, respectively, at the temperature of 1800 K. Further diffusion analysis reveals the unsymmetrical diffusion near the interface. Our experimental investigation evidenced our computation discovery
- …